350 results found.
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic Czech English German french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Improving Neural Machine Translation by Incorporating Hierarchical Subword Features
-
Paper track:NLP engineering experiment paper
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Makoto Morishita | NTT Communication Science Laboratories | JP |
| Author 2 | Jun Suzuki | NTT CS Lab. | JP |
| Author 3 | Masaaki Nagata | +81-774-93-5235 | JP |
| Main Contact | Makoto Morishita | NTT Communication Science Laboratories | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English German Spanish french
Availability:
From Owner
License:
<Not Specified>
Size:
334.4 MByte Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Jérémy Ferrero | Université Grenoble Alpes | FR |
| Author 2 | Frédéric Agnès | Compilatio | FR |
| Author 3 | Laurent Besacier | LIG | FR |
| Author 4 | Didier Schwab | Univ. Grenoble Alpes | FR |
| Main Contact | Jérémy Ferrero | Université Grenoble Alpes | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
English German Japanese Russian french
Availability:
Freely Available
License:
CreativeCommons-by-sa
Size:
>1,700,000 entries Production Status:
Existing-updated
Use:
Semantic Web
-
Paper title:Dbnary: Wiktionary as Linked Data for 12 Language Editions with Enhanced Translation Relations
-
Paper track:dataset description
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Gilles Sérasset | Université Joseph Fourier - Grenoble 1 | FR |
| Author 2 | Andon Tchechmedjiev | Université Joseph Fourier - Grenoble 1 | None |
| Main Contact | Gilles Sérasset | Université Joseph Fourier - Grenoble 1 | None |
Documentation:
Online at resource URL
Written
Corpus,
Language Type:
Multilingual
Languages:
English Finnish German Spanish french
Availability:
Freely Available
License:
Creative Commons
Size:
40 GByte Production Status:
Newly created-finished
Use:
Language Modelling
-
Paper title:European Union Language Resources in Sketch Engine
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Vít Baisa | Masaryk University | CZ |
| Author 2 | Jan Michelfeit | Lexical Computing Ltd | GB |
| Author 3 | Marek Medveď | Masaryk University | CZ |
| Author 4 | Milos Jakubicek | Lexical Computing | CZ |
| Main Contact | Vít Baisa | Masaryk University | None |
Documentation:
https://www.sketchengine.co.uk/eur-lex/
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English German french italian
Availability:
Freely Available
License:
<Not Specified>
Size:
884603 <Not Specified>Production Status:
Existing-used
Use:
Word Sense Disambiguation
-
Paper title:A new semantically annotated corpus with syntactic-semantic and cross-lingual senses
-
Paper track:General issues
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | RAKHO Myriam | <Not Specified> | None |
| Author 2 | LAPORTE Éric | <Not Specified> | None |
| Author 3 | CONSTANT Matthieu | <Not Specified> | None |
| Main Contact | RAKHO Myriam | Université Paris-Est | FR |
Documentation:
http://homepages.inf.ed.ac.uk/pkoehn/publications/europarl.pdfLanguage Type:
Multilingual
Languages:
German Greek Spanish french
Availability:
Freely Available
License:
As specified in the license.txt file found within the resource.
Size:
140 MByte Production Status:
Newly created-in progress
Use:
Question Classification
-
Paper title:A Multilingual Approach to Question Classification
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Aikaterini-Lida Kalouli | University of Konstanz | DE |
| Author 2 | Katharina Kaiser | University of Konstanz | DE |
| Author 3 | Annette Hautli-Janisz | University of Konstanz | DE |
| Author 4 | Georg A. Kaiser | University of Konstanz | DE |
| Author 5 | Miriam Butt | University of Konstanz | DE |
| Main Contact | Aikaterini-Lida Kalouli | University of Konstanz | None |
Documentation:
Publicly available in English within the resource.Language Type:
Multilingual
Languages:
English Portuguese Spanish french italian
Availability:
Freely Available
License:
open license
Size:
3000000 Documents OtherProduction Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Two Multilingual Corpora Extracted from the Tenders Electronic Daily for Machine Learning and Machine Translation Applications.
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | oussama ahmia | IRISA | FR |
| Author 2 | Nicolas Béchet | IRISA | FR |
| Author 3 | Pierre-François Marteau | IRISA | FR |
| Main Contact | oussama ahmia | IRISA | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Arabic English German Spanish french
Availability:
Freely Available
License:
<Not Specified>
Size:
578 MByte Production Status:
Existing-used
Use:
<Not Specified>
-
Paper title:Jointly Learning to Embed and Predict with Multiple Languages
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Poster - Tuesday
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Daniel C. Ferreira | Priberam, Instituto Superior Técnico | PT |
| Author 2 | André F. T. Martins | Priberam, Instituto de Telecomunicacoes | PT |
| Author 3 | Mariana S. C. Almeida | Priberam / Instituto de Telecomunicações | PT |
| Main Contact | Daniel C. Ferreira | Priberam, Instituto Superior Técnico | None |
Documentation:
<Not Specified>
Written
Tagger/Parser,
Language Type:
Multilingual
Languages:
American English German Mandarin Chinese Standard Arabic french
Availability:
Freely Available
License:
AGPL
Size:
20.5Mbyte Production Status:
Existing-used
Use:
Generic linguistic analyzer
-
Paper title:The LIMA Multilingual Analyzer Made Free: FLOSS Resources Adaptation and Correction
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Gaël de Chalendar | CEA LIST | FR |
| Main Contact | Gaël de Chalendar | CEA LIST | None |
Documentation:
On the GitHub pageLanguage Type:
Multilingual
Languages:
Dutch English German french italian
Availability:
Freely Available
License:
<Not Specified>
Size:
205245 <Not Specified>Production Status:
Newly created-in progress
Use:
Translation Studies
-
Paper title:Customization of the Europarl Corpus for Translation Studies
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Zahurul Islam | <Not Specified> | None | ||||
| Author 2 | Alexander Mehler | <Not Specified> | None | Goethe-Universität Frankfurt am Main | DE | Universität Frankfurt | DE |
| Main Contact | Zahurul Islam | AG Texttechnology | DE |
Documentation:
<Not Specified>




